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Abstract 

Background: Xylose isomerase (XI) catalyses the isomerisation of xylose to xylulose in bacteria and some fungi. 
Currently, only a limited number of XI genes have been functionally expressed in Soccharomyces cerevisioe, the 
microorganism of choice for lignocellulosic ethanol production. The objective of the present study was to search 
for novel XI genes in the vastly diverse microbial habitat present in soil. As the exploitation of microbial diversity is 
impaired by the ability to cultivate soil microorganisms under standard laboratory conditions, a metagenomic 
approach, consisting of total DNA extraction from a given environment followed by cloning of DNA into suitable 
vectors, was undertaken. 

Results: A soil metagenomic library was constructed and two screening methods based on protein sequence 
similarity and enzyme activity were investigated to isolate novel XI encoding genes. These two screening 
approaches identified the xyml and xym2 genes, respectively. Sequence and phylogenetic analyses revealed that 
the genes shared 67% similarity and belonged to different bacterial groups. When xyml and xym2 were 
overexpressed in a xyM-deficient Escherichia coli strain, similar growth rates to those in which the Piromyces XI gene 
was expressed were obtained. However, expression in 5. cerevisioe resulted in only one-fourth the growth rate of 
that obtained for the strain expressing the Piromyces XI gene. 

Conclusions: For the first time, the screening of a soil metagenomic library in E coli resulted in the successful 
isolation of two active Xls. However, the discrepancy between XI enzyme performance in E coli and 5. cerevisioe 
suggests that future screening for XI activity from soil should be pursued directly using yeast as a host. 
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Background 

The soil habitat is an immensely diverse environment. 
One gram of soil may harbour up to ten billion bacteria 
belonging to more than 4,000 to 7,000 different species 
[1], although this value varies according to the soil type 
[2]. However, only 1% of the soil bacterial flora can be 
cultivated under standard laboratory conditions [3]. 
Metagenomics, which consists of the extraction, cloning 
and analysis of the entire genetic material in a given 
habitat [4,5], has emerged as a tool for assessing the 
genetic information from uncultivable microorganisms. 
Metagenomic libraries are a powerful source for disco- 
vering new biological activities and have already been 
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successfully used to isolate novel hydrolytic enzymes 
such as amylases, cellulases and xylanases [6,7]. 

The screening of metagenomic libraries, which is a 
critical step for the successful isolation of new and 
improved biological activities, can be performed by 
sequence homology or activity-based assays [8]. In 
sequence-based screening, the isolation of novel proteins 
is based on homology searches. Although it impedes the 
discovery of entirely new gene sequences, it is an effi- 
cient method of isolating enzymes with different levels 
of similarity from previously identified genes. For 
instance, sequence-based screening from the metagen- 
ome of a hindgut microbiota of a wood-feeding higher 
termite identified 700 domains with homology to the 
glycoside-hydrolase catalytic site [9]. In activity-based 
assays, metagenomic libraries are constructed in expres- 
sion vectors to allow for the isolation of biological 
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activities from entirely new gene sequences that are also 
successfully expressed in the chosen host. 

Activity-based screening methods have been categor- 
ized into chromogenic, product detection and growth 
test assays [10]. Chromogenic assays are commonly 
used for the detection of hydrolases from metagenomic 
libraries [6,7], whereas a product detection method has 
been utilized for the identification of enzymes from a 
soil metagenome producing porphyrin intermediates 
[11]. Although it is a high-throughput method, 
growth-based screening has not been frequently 
described, as it may be difficult to develop and imple- 
ment a selection protocol that clearly discriminates 
growth from nongrowth. 

Metagenomic libraries have not yet been utilized for 
the isolation of novel genes involved in xylose catabo- 
lism. Xylose is the second most abundant sugar in nat- 
ure, and the production of cost-efficient lignocellulosic 
bioethanol requires its complete fermentation [12]. One 
xylose catabolic pathway is found in naturally xylose-uti- 
lizing bacteria [13] and some fungi [14,15] and includes 
the isomerisation of xylose to xylulose via xylose isomer- 
ase (XI). High ethanol yields from xylose have been 
reported for Saccharomyces cerevisiae strains harbouring 
actively expressed XI genes [15-18]. 

In this study, the possibility of isolating xylose cata- 
bolic enzymes from a soil metagenomic library was eval- 
uated. Both sequence-based and growth-based screening 
were utilized. Each method resulted in the isolation of a 
novel gene sequence coding for a XI. Restoration of 
growth on xylose was verified in both Escherichia coli 
and S. cerevisiae strains by overexpression of the isolated 
genes in strains lacking the initial xylose catabolic path- 
way. The advantages and disadvantages of the two 
screening methods are discussed. 

Results 

Isolation of XI genes from soil metagenome 

A soil metagenomic library with an estimated size of 
1.26 x 10 5 clones was constructed in plasmid pRSETB 
(Table 1) using DNA extracted from garden compost 
and screened for XI activity using either sequence analy- 
sis or growth assays. 

For the sequence-based screening, degenerate primers 
were designed based on the alignment of amino acid 
sequences of known XIs from soil microorganisms such 
as Streptomyces sp. and Rhizobium sp. Amino acid 
sequences of XI genes whose expression has been 
attempted in S. cerevisiae, such as the ones from Piro- 
myces sp. [14] and E. coli [19], were also included 
(Figure 1). The first polymerase chain reaction (PCR) 
was performed with primers DF1 and DR2 (Table 2) 
and with the soil metagenomic library used as a tem- 
plate. A fragment of approximately 750 bp was 



Table 1 Plasmids and strains used in this study 



Plasmids/ 
strains 



Relevant characteristics 



Source 



Plasmids 



pGEM-T 
Easy 


PCR product cloning, ampR 


Promega 
(Madison, Wl, 
USA) 


pRSETB 


ampR, T7p-T7t 


Invitrogen 
(Carlsbad, CA, 
USA) 


p426TEF 


URA3 TEFp-CYC1t 


[43] 


pRESTB- 
XlPiromyces 


AmpR, T7p-xiPiromyces-T7t 


This study 


pRSETB- 
Xym1 


AmpR, T7p-xym7-T7t 


This study 


pRSETB- 
Xym2 


AmpR, T7p-xym2-T7t 


This study 


p426TEF- 
XiPiromyces 


URA3, p426TEFp-XiPiromyces- 
CYC1t 


This study 


p426TEF- 
Xym1 


URA3, p426TEFp-xym7-CYC1t 


This study 


p426TEF- 
Xym2 


URA3, p426TEFp-xym2-CYC1t 


This study 



Escherichia coli 
strains 

Ele 
Blue 

HB101 

DH5a 

TMB2010 
TMB201 1 
TMB2012 
TMB2013 



Saccharomyces 
cerevisiae strains 

TMB3044 



TMB3363 
TMB3359 
TMB3364 
TMB3365 



ara-14, proA2, xyl-5, leuB6, thi-1 
Sm R 

HB101 + pRSETB 
HB101 + pRSETB-xiPiromyces 
HB101 + pRSETB-Xyml 
HB101 + pRSETB-Xym2 



CEN.PK 2-1C Agre3, his3:.PGK1p- 
XKS1-PGKh,TAL1::PGKlp-TAL1- 
PGK1t,TKL Iv.PGKl p-TKL 1-PGKh, 
RKI1::PGK1p-RKI1-PGK1t, RPElv. 
PGK1p-RPE1-PGK1t,ura3 

TMB3044, p426TEF 

TMB3044, p426TEF-XiPiromyces 

TMB3044, p426TEF-Xym1 

TMB3044, p426TEF-Xym2 



Stratagene (La 
Jolla, California, 
USA) 

Takara Bio (Otsu, 
Shiga, Japan) 

Life Technologies 
(Rockville, MD, 
USA) 

This study 
This study 
This study 
This study 



[26] 



This study 
This study 
This study 
This study 



amplified. The purified fragment was then utilized as 
the template for a nested PCR with internal primers 
DF2 and DR1. A single fragment of about 350 bp was 
obtained, purified from an agarose gel and cloned into 
the pGEM-T Easy Vector System (Promega, Madison, 
WI, USA) (Table 1). Sequence analysis revealed that this 
fragment belonged to a gene coding for a XL The 
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Fl 



F2 



Rl 



R2 



1st forward 

TTT|TTTT|TTT 

110 

Piromyces qkIgipyycfhd' 
Streptomyces ddaakvhef 1 

HK HVPFYCFHD' 
SL GAPYYCFHD. 

Rizobium sk gidyysfhd 1 
ag gahgvtfhd: 
tk gvpyycfhd: 
Xantomonas tk gapyycfhd: 

HK NVPYYCFHD' 
SL GAPYFCFHD. 

ClostridiurriEK gapffafhd: 



Xylose isomer ase alinhagment 



2nd forward 



150 

gikllwstaIv 
gvri gains t 
gvkllwgta c 
gtkllwgta l 
gikllwgta c 
gmavpmatt l 
gikllwgta l 
gvkllwgta l 
gvkllwgta c 
gvkllwgta l 
pvkllwgtt 



1st Reverse 




2nd reverse 



340 

]IIRGGGFVTGG r 
IVVRGG FTSDVi 

]ilkag|fttgg: 

IVLAGG FKTGG 1 
IILKAGGFTHGG^ 
JliLESAGYE- GP] 

^vlrqg^lapgg: 

'VLRAG PQGGG 1 
3IIKAG FTTGG: 

^ilqag fktgg^ 

IVLKAG LKKGS: 



Figure 1 Conserved regions utilized for the construction of the degenerate primers. The choice of regions was based on multiple 
alignments of XI amino acid sequences. The microorganisms chosen, in presented order, are Piromyces sp., Streptomyces diostoticus, Escherichia 
coli, Pseudomonos sp., Rhizobium sp., Streptomyces coelicolor, Socchorophogus sp., Enterobocter sp., Mesorhizobium loti and Clostridium sp. 



complete gene sequence was then obtained by PCR uti- 
lizing the metagenomic library as the template and pri- 
mers designed to the sequence of the known 350-bp 
fragment and in plasmid pRSETB. The gene sequence 
encoded a 443-amino acid protein with 83% identity 
with Sorangium cellulosum XI and was designated xyml. 

For the activity screening, the metagenomic library 
was transferred to the host strain HB101 (Table 1) that 
has a nonfunctional xylA, which allows selection for 
growth on xylose. After bacterial transformation, cells 
were plated in defined media supplemented with xylose 
and incubated at 37°C for four to ten days. Positive 
(pRSETB-Xi Piromyces) and negative (pRSETB) controls 
were also transformed into the screening strain and pla- 
ted in selective media to validate the screening method. 
Plasmids were extracted and sequenced from colonies 
that grew on the selective media. All extracted plasmids 
contained identical inserted fragments with an insert 
size of approximately 2.5 kb. Sequencing analysis 
revealed a complete open reading frame that corre- 
sponded to a XI gene named xym2. xym2 encoded a 

Table 2 Primers used 9 

Primer Sequence 



protein of 442 amino acids with 73% similarity to the XI 
from Robiginitalea biformata and 67% identity with 
.rymi -encoded XI. A phylogenetic tree constructed 
using MEGA4 software [20] and 22 different XI amino 
acid sequences revealed that xyml -encoded XI was most 
similar to XIs from the Proteobacteria, while xym2- 
encoded XI was more similar to the XIs from members 
of the Bacterioides phylum (Figure 2). 

Growth evaluation in E. coli 

The xyml and xym2 genes and the positive control xylA 
gene from Piromyces [14] were each cloned into the 
multicopy vector pRSETB, generating plasmids pRSETB- 
Xyml, pRSETB-Xym2 and pRSETB-XiPiromyces (Table 
1). Strain HB101 was transformed with each of the 
three constructs, as well as the negative control 
pRSETB, to generate the four strains: TMB2010 (nega- 
tive control), TMB2011 (Piromyces xylA), TMB2012 
(xyml) and TMB2013 (xym2). 

Growth was evaluated for the four strains in SM3 
liquid media with xylose. Growth with xylose as the sole 



DF1 5' AAGCT(A/C/G/T)GG(A/C/G/T)G(C/T)(A/C/G/T)CCT T(C/T)(C/T)TATOm(C/T)CACGAC 3' 

DR2 5'TCCTCC(A/C/G/T)TGCTG(C/T)(T/A)(T/A)GCCGCC(A/C/G/T)CC(A/C/G/T)CG(A/C/G/T)^ 

DF2 5' GGT(A/G)T(A/C/G/T)AA(A/G)CT(A/C/G/T)CTCTGGGG(A/C/G/T)CAGGCCAA(C/T)CT(A/C/G/T)^(C/T) 3' 

DR1 5' (A/G)TG(C/T)TOTGGGG(C/T)TC(C/T)TGCGGOTGGGCTC (A/C/G/T)A(T/G) 3' 

pRSETBF 5' GGTGGACAGCAAATGGGTCGG 3' 

pRSETBR 5' GGGCmGTOGCAGCCGGATC 3' 

XYM1 F 5' CC TGGATCC ATGAGCGTO^OTGGCGACAAAG 3' 

XYM 1 R 5' TGG GTCGAOT ACCGGATCCACCGGTCATAATG 3' 

XYM2F 5'CAG GAATTC ATGAAAGTACCGTAGGAGACAAGG 3' 

XYM2R 5' TGGGTCGACTOAATAAACCTGCTAATCAAATOTCAATATAC 3' 



Restriction sites, when present, are underlined. 
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— S.degradans 

— S.cellulosum 

— XYM1 

— C.japonicus 

— V. bacterium 

— F.johnsoniae UW101 

— Algoriphagus sp, 

— XYM2 

— R.biformata 

— P.distasonis 

— B.ovatus 

— Orpinomyces 

— Piromyces 

— Xcampestris 

— E.coli 

— P.syringae 

— A.thaliana 

— C.thermosaccharolyticum 

— B.subtilis 

— B,megaterium 

— T.thermophilus 

— C.phytofermentans 

Figure 2 Neighbour-joining tree showing the phylogenetic positions of xy ml- and xy/n2-encoded Xls based on conserved amino acids 
of different Xls. Microorganism sources of Xls were chosen based on sequence similarity and/or active expression in Soccharomyces cerevisiae. 
The phylogenetic tree was constructed with MEGA software using an unweighted group method with arithmetic mean (UPGMA) bootstrapped 
with 6,000 interactions. 
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carbon source was reestablished for all strains overex- 
pressing any of the Xl-encoding genes (Figure 3). No 
significant differences in growth rates and final Optical 
density (OD) were observed between the positive con- 
trol strain, TMB2011, and the strains overexpressing 
xyml and xym2 that were isolated from the metage- 
nomic library (Figure 3). 

Growth evaluation in S. cerevisiae 

XI is a key enzyme for the construction of recombinant 
S. cerevisiae strains for xylose conversion to ethanol. 
However, despite many efforts [19,21-24], only a few XI 
genes have been successfully expressed in S. cerevisiae at 
sufficient levels to allow growth on xylose [15,16,25]. 
Therefore, xyml and xym2 were overexpressed in S. cer- 
evisiae to assess growth in a recombinant S. cerevisiae 
strain lacking the initial xylose catabolic pathway, xyml, 
xym2 and the xylA gene from Piromyces sp. (positive 
control) were cloned in the multicopy vector p426TEF 



to generate plasmids p246TEFXiPiromyces, p426TEF- 
Xyml and p426TEF-Xym2 (Table 1). The S. cerevisiae 
screening strain TMB3044 that has been optimised for 
efficient pentose utilization, but which lacks the initial 
conversion step from xylose to xylulose [26], was trans- 
formed with the three constructed plasmids in addition 
to the empty plasmid to generate strains TMB3363 
(empty plasmid), TMB3359(XiPiromyces), TMB3364 
(xyml) and TMB3365 (xym2). No growth was observed 
for the negative control. In contrast, overexpression of 
both xyml and xym2 enabled growth on xylose in S. 
cerevisiae (Figure 4). However, the strains carrying either 
xymlor xym2 grew at a lower growth rate (0.021 hour" 1 
for both strains) than the positive control harbouring 
Piromyces sp XI (0.069 hour" 1 ± 0.006) (Figure 4). 

Evaluation of Xl-specific activities in both hosts 

Xl-specific activities were compared in cell extracts of E. 
coli and S. cerevisiae cells grown in defined medium 



Parachin and Gorwa-Grauslund Biotechnology for Biofuels 201 1, 4:9 
http://www.biotechnologyforbiofuels.eom/content/4/1/9 



Page 5 of 10 



strain, the specific activity (0.420 U/mg protein" 1 ) was in 
the same range as the ranges for the .rym 7 -over expres- 
sing strain and the positive control in E. coli. In con- 
trast, the specific activity in the #j/m2-overexpressing 
strain was at the level of the background activity in S. 
cerevisiae (0.195 U/mg protein" 1 ) (Figure 5). 

Discussion 

In this study, for the first time, two screening methods 
were established in parallel for the isolation of novel XI- 
encoding genes from a soil metagenomic library. The 
sequence-based method using degenerate primers 
deduced from the alignment of amino acid sequences of 
several XIs from soil microorganisms rapidly resulted in 
the isolation of the first Xl-encoding gene xyml from a 
metagenomic library. Growth-based screening was also 
used for the first time to isolate the Xl-encoding gene 
xym2 from soil metagenomic library using an E. coli 
strain lacking XI activity. The two different methods 
resulted in the isolation of two Xl-encoding genes that 
shared 67% identity at the protein level and could both 
complement E. coli and S. cerevisiae strains lacking a 
xylose catabolic enzyme. 

Some examples where the sequence-based screening 
method has already been used include (1) the identifica- 
tion of genes responsible for the hydrolytic dehalogena- 
tion of 4-chlorobenxoate from a metagenomic library 
constructed from a denitrifying, degrading consortium 
[27] and (2) the isolation of a polyketide synthase gene 
from a metagenome constructed from marine sediment 
in the East China Sea [28]. It is a high-throughput 
screening method, but one disadvantage is that it selects 
for gene sequences with close homology to those 




Time(h) 

Figure 4 Aerobic growth of S. cerevisiae recombinant strains carrying p426TEF-Xym1 (filled triangle), p426TEF-Xym2 (filled circle), 
p426TEF-XiPiromyces (filled square) or the empty plasmid p426TEF (filled diamond). All strains were cultivated at 30°C in mineral media 
supplemented with 50 g/L xylose. 




Time (h) 

Figure 3 Aerobic growth of E. coli recombinant strains carrying 
pRSETB-Xyml (filled triangle), pRSETB-Xym2 (filled circle), 
pRSETB-XIPiromyces (filled square), or the empty plasmid 
pRSETB (filled diamond). All strains were cultivated at 37°C in SM3 
media supplemented with 30 g/L xylose. 



with glucose. For each microorganism, positive (Piro- 
myces sp. XI) and negative (empty vector) control strains 
were included for comparison with the strains overex- 
pressing either xyml or xym2. 

In S. cerevisiae, the background activity measured in 
the negative control (0.195 U/mg protein" 1 ) was twice as 
high as that measured in E. coli (0.059 U/mg protein" 1 ) 
(Figure 5). Strains overexpressing the positive control or 
xyml had about the same specific XI activity in both E. 
coli and S. cerevisiae. For the ,r)/m2-overexpressing 
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Figure 5 XI specific activity (U/mg protein" 1 ) in E. coli (gray) 
and S. cerevisiae (stripped) strains overexpressing xyml, xym2 
and Piromyces sp. XI gene. The background activity was obtained 
from the same strains carrying the respective empty vectors. 



utilized for the design of the degenerate primers. This 
was notably exemplified by the isolation of closely 
related chitinase genes from cultured and uncultured 
marine bacteria [29]. 

In contrast, activity-based screening permits the iden- 
tification of previously unknown gene sequences, as the 
screening is based on the gene product only. One disad- 
vantage, however, is that activity-based screening that 
relies on chromogenic or fluorogenic assays often 
require high-throughput equipment, such as colony 
pickers, that transfer colonies from plates to the assay 
system. When possible, it is therefore advisable to 
develop a growth-based screening method because the 
method is simple (each library can be plated on a lim- 
ited number of plates, as very few clones will grow) and 
enables high-throughput (only the clones displaying the 
desired activity are able to grow). This method is limited 
in that a host strain or an experimental setup must be 
designed that does not enable the host to grow unless 
the desired activity is present. 

Screening by growth has previously been used to iso- 
late genes from a soil metagenomic library conferring 
Na + (Li + )/H + antiporter activity to an antiporter-deficient 
E. coli host strain [30]. Novel DNA polymerase activities 
have also been isolated from glacial ice metagenomic 
libraries using a cold-sensitive E. coli strain that har- 
boured a mutation in the DNA polymerase domain gen- 
erating lethality at temperatures below 20°C [31]. 
Finally, a novel D-3-hydroxybutyrate short-chain dehy- 
drogenase with less than 35% similarity to known pro- 
teins with similar activity has been isolated from soil 
metagenomic libraries using a mutant of Sinorhizobium 
meliloti that was unable to synthesize D-3-hydroxybuty- 
rate [32]. In the present study, xyml- and ,r)/m2-encoded 
XIs displayed strong similarity in amino acid sequence, 
although they were isolated by using different methodol- 
ogies. This may be explained by the fact that most of 



the XI sequences that were used to identify homologous 
regions for the sequence-based screening originated 
from organisms that were know to be present in soil. 

Screening for Xl-encoding genes by growth versus 
nongrowth was successfully implemented in E. coli, 
since the expression of xyml and xym2 genes could 
restore growth in xylose minimal media with the same 
growth rate observed for the positive control overex- 
pressing the Piromyces XI gene. However, growth was 
restored when the same genes were transferred to the 
heterologous host S. cerevisiae, where the activity is 
required for xylose fermentation to ethanol, the growth 
rate was four times lower than that observed for Piro- 
myces XI. Expression of Xl-encoding genes in S. cerevi- 
siae is a complex issue [33]. The first one to be 
expressed at high levels in S. cerevisiae was Piromyces 
XI [18], and it occurred after many unsuccessful 
attempts and limited success with several other bacterial 
and plant XIs [19,21-24]. Successful expression of XI in 
S. cerevisiae and its growth rate on xylose are not always 
correlated to sequence similarity or enzymatic activity 
(Table 3). In our case, xyml -encoded XI had the same 
XI activity as Piromyces sp XI, but the growth rate was 
fourfold lower. In contrast, the specific activity of the 
,r)/m2-encoded XI was in the range of the background 
activity, but the growth rate was identical to the strain 
expressing xyml. The growth rate was also in the same 
range when the Xanthomonas campestris XI gene was 
expressed, although the reported activity range was 
about 20 times lower than that for the xyml -overexpres- 
sing strain (Table 3). Finally, a high activity level and 
growth rate were achieved when overexpressing a 
codon-optimised XI gene from Clostridium phytofer- 
mentans that shared only 54% similarity with Piromyces 
sp. at the protein level (Table 3) [16]. Differences in 
kinetic properties may also explain the discrepancies 
among all these results. For instance, the affinity for 
xylose may be much higher in xym2- than in xyml- 
encoded XI, so that the lower rate of conversion would 
be compensated for by a higher affinity for the substrate. 
Alternatively, the novel XI may be more inhibited by 
xylitol than the one from Piromyces sp. or C. phytofer- 
mentans. We were not able to measure K m or K { for the 
new XIs, because the activities were below the detection 
limit. Transport may also explain differences in beha- 
viour between S. cerevisiae and E. coli, since active 
xylose transporters are present in E. coli [34,35] but 
missing in S. cerevisiae. 

E. coli has been the preferred choice as the host for 
screening metagenomic libraries [6] because numerous 
molecular tools are available and E. coli genetics are 
well elucidated. In our case, E. coli had the additional 
advantage that it is naturally able to utilize xylose, allow- 
ing the isolation of the desired activity without any 
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Table 3 Reported specific activity and maximum specific 


growth rate for XI genes cloned in Saccharomyces cerevisiae* 


Source of XI 


Activity (U/mg 
protein) 


Aerobic growth rate on 
xylose (hour 1 ) 


Source 


Identity with 
xyml XI 


Identity with 
xym2 XI 


Identity with 
Piromyces XI 


Metagenome xym1 b 


0.33 ± 0.05 


0.02 


This 
study 


100% 


67% 


61% 


Metagenome xym2 b 


0.20 ± 0.04 


0.02 


This 
study 


67% 


100% 


63% 


Piromyces sp b,c 


0.35 


0.07 


This 
study 


61% 


63% 


100% 


Piromyces sp. 


0.3 to 1.1 


0.22 


[18] 


61% 


63% 


100% 


Piromyces sp. c 


0.5 to 0.8 


0.02 


[17] 


61% 


63% 


100% 


Piromyces sp. c 


0.0538 


0.056 


[16] 


61% 


63% 


1 00% 


Orpinomyces sp. 


1.6 


0.01 


ri5i 

L 1 -'J 


61% 


61% 


94% 


Xanthomonas 


0.016 


0.02 


T441 


64% 


63% 


61% 


compestris 














Bacteroides 
thetoiotaomicron 


0.14 


n.m. 


[45] 


61% 


64% 


83% 


Clostridium 
phytofermentons c 


0.0344 


0.057 


[16] 


54% 


52% 


54% 


Clostridium 
thermosulfurogenes 


n.m. 


n.d. 


[46] 


51% 


49% 


49% 


Escherichia coli 


n.m. 


n.d. 


[19] 


50% 


52% 


48% 


Bacillus subtilis 


n.m. 


n.m. 


[47] 


48% 


48% 


47% 


Lactobacillus pentosus 


n.d. 


n.d. 


[47] 


48% 


47% 


46% 


Thermus thermophilus 


0.0007 to 0.012 


n.m. 


[48] 


31% 


30% 


29% 



a n.m., not measured; n.d., not detected. Sequence identity with xyml, xym2 and Piromyces XI gene is also given at the protein level. b Glucose-grown cells. 
c Codon-optimised gene. 



other limitation up- or downstream of XL The use of 
only one host, however, may limit the exploitation of 
microbial diversity for a given habitat due to, for 
instance, differences in codon utilization and/or mRNA 
and protein stability among different hosts. E. coli was, 
for instance, estimated to express only 40% of genes 
from diverse microbial origin [36]. In a previous study 
where growth-based screening was utilized for the isola- 
tion of novel D-3-hydroxybutyrate short-chain dehydro- 
genases from the soil metagenomic library, 25 positive 
clones were initially isolated when the library was 
screened in Sinorhizobium melioti; however, only one 
clone complemented the same phenotype in E. coli [32] . 
Screening of a soil metagenomic library performed in 
six different bacterial hosts identified only one case 
where the molecule-producing clone could be isolated 
using two different hosts [37]. 

Further screening of the soil metagenomic library 
described in this study should be performed in S. cerevi- 
siae to investigate more suitable XIs for ethanol produc- 
tion from xylose. This is complicated by the fact that, 
whereas in bacterial metagenomic libraries large frag- 
ments are cloned and theoretically all genes within the 
fragment are transcribed and translated, only the closest 
gene to the methylated cap on the 5' terminus of the 
mRNA is translated in yeast [38]. Construction of future 
libraries for use in yeast will have to combine the 



cloning of small fragments corresponding to single gene 
size with an efficient transformation protocol to accom- 
modate the larger colony numbers required to represent 
the same soil population. Alternatively, construction and 
screening of metagenomic cDNA libraries may be 
attempted; however, neither of these strategies has yet 
been reported. 

Conclusions 

In the present study, a soil metagenomic library was 
successfully constructed and screened for the first time 
by using sequence- and growth-based methods to isolate 
genes conferring xylose isomerase activity. It enabled 
the isolation of xyml and xym2 genes, respectively, that 
shared 67% identity at the protein level. Both genes 
could restore growth in E. coli and S. cerevisiae strains 
lacking the initial xylose catabolic pathway. However, 
the limited growth rates in S. cerevisiae highlight the 
importance of screening for XI activity in this host 
instead. 

Methods 

Plasmids and strains 

Plasmids and strains used in this study are listed in 
Table 1. Strains were stored as recommended by the 
manufacturer or as 20% glycerol stocks in liquid media 
at -80°C. 
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Cultivation conditions 

E. coli strains were plated in Luria broth (LB) media 
plates (10 g/L Tryptone, 5 g/L yeast extract and 10 g/L 
NaCl, agar 15 g/L) and grown at 37°C overnight. Anti- 
biotics were added when necessary to a final concentra- 
tion of 30 ug/mL kanamycin and 50 ug/mL ampicillin 
and streptomycin. Single E. coli colonies were inoculated 
in LB media (10 g/L Bacto Tryptone, 5 g/L yeast extract 
and 10 g/L NaCl) for approximately 12 hours at 37°C in 
an incubator with constant agitation at 180 rpm (Boule 
Medical AB, Stockholm, Sweden). 

For the metagenomic library screening, cells were pla- 
ted in defined SM3 media [39] supplemented with 11 g/ 
L xylose, 15 g/L agar, 1 mM isopropyl-P-D-thiogalacto- 
side (IPTG) and 50 ug/ml ampicillin. 

S. cerevisiae strains were grown on yeast nitrogen base 
(YNB) media plates (6.7 g/L Difco YNB without amino 
acids; Becton Dickinson, Sparks, MD, USA) supplemen- 
ted with 20 g/L glucose and 20 g/L agar. 

For growth kinetics on xylose, E. coli recombinant 
strains were pregrown in 20 mL of LB media at 180 
rpm and 30°C until the end of the exponential phase. 
Cells were washed in distilled water and transferred to 
500-mL Erlenmeyer flasks containing 50 mL of SM3 
media supplemented with 30 g/L xylose and 1 mM 
IPTG at a starting OD 620nm = 0.1. 

S. cerevisiae strains were pregrown in 500-mL Erlen- 
meyer flasks containing 50 mL of YNB medium supple- 
mented with 20 g/L glucose at 180 rpm and 30°C for 
approximately 12 hours. Cells were washed in distilled 
water and inoculated at OD 620n m = 0.2 in 100 mL of 
YNB medium supplemented with 50 g/L xylose in 1-L 
Erlenmeyer flasks and grown at 180 rpm and 30°C. 

Standard molecular procedures 

Standard DNA manipulation techniques were used [40]. 
Restriction and ligation enzymes were purchased from 
Fermentas (St. Leon-Rot, Germany). Primers were pur- 
chased from Eurofins MWG Operon (Ebersberg, Ger- 
many). DNA extraction from agarose gels and 
purification of PCR products were performed using the 
QIAquick extraction kit (Qiagen, Hilder, Germany). 
Plasmid DNA was prepared using the Bio-Rad Miniprep 
Kit (Bio-Rad, Hercules, CA, USA). Sequencing was per- 
formed at Eurofins MWG Operon. If not otherwise sta- 
ted, bacterial transformation was performed by heat 
shock with competent cells prepared by using the Inoue 
Method for Preparation and Transformation of Compe- 
tent E. coli as previously described [41]. Yeast transfor- 
mation was performed as previously described [42]. 

Construction of the soil metagenomic library 

Soil DNA was extracted from the upper layer of a gar- 
den in Hoor, southern Sweden, during Spring 2008. 



DNA extraction was performed using the PowerSoil 
DNA Isolation Kit (MoBio, Carlsbad, CA, USA). Isolated 
DNA was digested with BamHl and Mbol. Agarose gel 
electrophoresis was performed for purification of DNA 
fragments ranging between 2.0 and 6.0 kb to generate a 
small insert library with an average size of 4.0 kb. Vec- 
tor pRSETB was digested with BamHl and treated with 
alkaline phosphatase to prevent plasmid self-ligation. 
DNA fragments were ligated into pRSETB. Bacterial 
transformation was performed with ElectroTen-Blue 
competent cells (Stratagene, La Jolla, CA, USA) accord- 
ing to the manufacturer's instructions. The library size 
was estimated on the basis of the total number of E. coli 
clones minus the ones with empty plasmid. 

Cloning and expression of xym 1, xym2 and XI from 
Piromyces sp 

The isolated isomerase genes were cloned both into E. coli 
and S. cerevisiae expression vectors. The xyml gene was 
amplified with primers XYM1F and XYM1R, which con- 
tain restriction sites for BamHl and Sail, respectively 
(Table 2). After PCR, both the amplified gene and plas- 
mids pRSETB and p426TEF were digested. Plasmids with 
inserts obtained after ligation and bacterial transformation 
were sequenced and named pRSETB-Xyml and p426TEF- 
Xyml, respectively (Table 1). Cloning of the xym2 gene 
followed the same procedure as for xyml, with the excep- 
tion that EcoRl and Sail restriction sites were used, xyml 
and xym2 sequences will be publicly available when the 
corresponding patent application is published. 

A codon-optimised version of the Xl-encoding gene 
from Piromyces sp. was amplified by PCR utilizing plas- 
mid YeplacHXT-XI as template [26]: forward primer 5' 
GGA GGA TCC ATGGCTAAGGAATATTTTCCACAA 
ATTC 3' and reverse primer 5' TTGGAATTCTTACTG 
ATACATTGCAACAATAGCTTCG 3'. Restriction sites 
for BamHl and EcoRl are underlined in the forward and 
reverse primers, respectively. Plasmid p426TEF [43], 
pRSETB (Invitrogen, Carlsbad, CA, USA) and the PCR 
product were digested with BamHl and EcoRl and then 
ligated. Clones with inserts were sequenced prior to 
yeast transformation. 

Enzyme activities 

S. cerevisiae strains were grown in YNB medium supple- 
mented with 20 g/L glucose at 30°C to the early station- 
ary phase. E. coli strains were pregrown in LB media 
supplemented with 100 ug/ml ampicillin for about 5 
hours. After that, cells were washed with water and 
inoculated for an initial OD 620 nm = 0.2 in SM3 medium 
supplemented with 10 g/L glucose and 1 mM IPTG at 
37°C overnight. 

Yeast and bacteria strains were harvested by centrifu- 
gation at 5,000 x g for 5 minutes and washed once with 
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distilled water. Wet cells were suspended (0.5 mg/mL) 
in Y-per yeast or B-per bacteria extraction protein 
reagent (Pierce Biotechnology, Rockford, IL, USA) in a 
2-mL Eppendorf tube and incubated on a turning table 
at room temperature for 50 minutes. Cell debris was 
spun down for 5 minutes at 16,100 x g (Z160M table 
centrifuge; HERMLE Labortechnik, Wehingen, Ger- 
many). Protein concentration was determined using 
Coomassie Protein Assay Reagent (Pierce Biotechnol- 
ogy). Bovine serum albumin was used to determine the 
standard curve. 

XI activity was determined at 30°C using sorbitol 
dehydrogenase as previously described [18]. Assays were 
performed in a U-2000 spectrophotometer (Hitachi, 
Tokyo, Japan). Enzyme assays were performed in three 
biological replicates. 
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